Solving Constrained Combinatorial Optimisation Problems via MAP Inference without High-Order Penalties
نویسندگان
چکیده
Solving constrained combinatorial optimization problems via MAP inference is often achieved by introducing extra potential functions for each constraint. This can result in very high order potentials, e.g. a 2-order objective with pairwise potentials and a quadratic constraint over all N variables would correspond to an unconstrained objective with an order-N potential. This limits the practicality of such an approach, since inference with high order potentials is tractable only for a few special classes of functions. We propose an approach which is able to solve constrained combinatorial problems using belief propagation without increasing the order. For example, in our scheme the 2-order problem above remains order 2 instead of order N . Experiments on applications ranging from foreground detection, image reconstruction, quadratic knapsack, and the M-best solutions problem demonstrate the effectiveness and efficiency of our method. Moreover, we show several situations in which our approach outperforms commercial solvers like CPLEX and others designed for specific constrained MAP inference problems. Introduction Maximum a posteriori (MAP) inference for graphical models can be used to solve unconstrained combinatorial optimization problems, or constrained problems by introducing extra potential functions for each constraint (Ravanbakhsh, Rabbany, and Greiner 2014; Ravanbakhsh and Greiner 2014; Frey and Dueck 2007; Bayati, Shah, and Sharma 2005; Werner 2008). The main limitation of this approach is that it often results in very high order potentials. Problems with pairwise potentials, for example, are very common, and adding a quadratic constraint (order 2) over N variables results in an objective function of order N . Optimizing over such high-order potentials is tractable only for a few special classes of functions (Tarlow, Givoni, and Zemel 2010; Potetz and Lee 2008; Komodakis and Paragios 2009; Mézard, Parisi, and Zecchina 2002; Aguiar et al. 2011), such as linear functions. Recently, Lim, Jung, and Kohli (2014) proposed cuttingplane based methods to handle constrained problems without Copyright c © 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. introducing very high order potentials. However, their approaches require exact solutions of a series of unconstrained MAP inference problems, which is, in general, intractable. Thus their approaches are again only applicable to a particular class of potentials and constraints. To tackle general constrained combinatorial problems, our overall idea is to formulate the unconstrained combinatorial problem as a linear program (LP) with local marginal polytope constraints only (which corresponds to a classical MAP inference problem), and then add the “real” constraints from the original combinatorial problem to the existing LP to form a new LP. Duality of the new LP absorbs the “real” constraints naturally, and yields a convenient message passing procedure. The proposed algorithm is guaranteed to find feasible solutions for a quite general set of constraints. We apply our method to problems including foreground detection, image reconstruction, quadratic knapsack, and the M-best solutions problem, and show several situations in which it outperforms the commercial optimization solver CPLEX. We also test our method against more restrictive approaches including Aguiar et al. (2011) and Lim, Jung, and Kohli (2014) on the subsets of our applications to which they are applicable. Our method outperforms these methods in most cases even in settings that favor them. Preliminaries Here we consider factor graphical models with discrete variables. Denote the graph G = (V,C), where V is the set of nodes, and C is a collection of subsets of V. Each c ∈ C is called a cluster. If we associate one random variable xi with each node, and let x = [xi]i∈V, then it is often assumed that the joint distribution of x belongs to the exponential family p(x) = 1 Z exp [∑ c∈C θc(xc) ] , where xc denotes the vector [xi]i∈c. The real-valued function θc(xc) is known as a potential function. Without loss of generality we make the following assumption to simplify the derivation: Assumption 1. For convenience we assume that: (1) C includes every node, i.e., ∀i ∈ V, {i} ∈ C; (2) C is closed under intersection, i.e., ∀c1, c2 ∈ C, it is true that c1 ∩ c2 ∈ C. MAP and its LP relaxations The goal of MAP inference is to find the most likely assignment of values to the random variable x given its joint distribution, which is equivalent to x∗ = argmaxx ∑ c∈C θc(xc). The MAP inference problem is NP-hard in general (Shimony 1994). Under Assumption 1, by introducing μ = [μc(xc)]c∈C corresponding to the elements of C one arrives at the following LP relaxation (Globerson and Jaakkola 2007): max μ∈L ∑ c∈C ∑ xc μc(xc)θc(xc), (1) L = { μ ∣∣∣∣ ∀c ∈ C,xc, μc(xc) > 0,∑xc μc(xc) = 1, ∀c, s ∈ C, s ⊂ c,xs,∑xc\s μc(xc) = μs(xs) } . The Problem We consider the following constrained combinatorial problem with K constraints: max x ∑ c∈C θc(xc), s.t. ∑ c∈C φc (xc) 6 0, k ∈ K . (2) where K = {1, 2, . . . ,K}, and φc (xc) are real-valued functions. Since θc(xc) and φc (xc) can be any real-valued functions, problem (2) represents a large range of combinatorial problems. We now provide a few examples. Example 1 (`0 norm constraint). Given b ∈ R, ‖x ‖0 6 b can be reformulated as ∑ i∈V φi(xi) 6 0, φi(xi) = 1(xi 6= 0)− b/|V |, (3) where 1(S) denotes the indicator function (1(S) = 1 if S is true, 0 otherwise). This type of constraint is often used in applications where sparse signals or variables are expected. Example 2 (Sparse gradient constraint). Some approaches to image reconstruction (e.g. Maleh, Gilbert, and Strauss (2007)) exploit a constraint reflecting an expectation that image gradients are sparse, such as ∑ {i,j}∈C ‖xi − xj‖0 6 b, where b is a threshold. The constraint can be rewritten as ∑ i∈V φi(xi) + ∑ {i,j}∈C φi,j(x{i,j}) 6 0, (4) φi(xi) = −b/|V |, φi,j(x{i,j}) = 1(xi = xj). Example 3 (Assignment difference constraint). Image segmentation (Batra et al. 2012) and the M-best MAP solutions problem (Fromer and Globerson 2009), often require the difference between a new solution x and a given assignment a to be greater than a positive integer b, i.e. ‖x−a ‖0 > b. This can be reformulated as ∑ i∈V φi(xi) 6 0, φi(xi) = −1(xi 6= ai) + b/|V |. (5) For b = 1, it can also be reformulated as ∑ i φi(xi) + ∑ {i,j}∈T φi,j(xi, xj) ≤ 0, (6) φi(xi)= { 1− di, xi=ai, 0, xi 6=ai, φij(xi, xj)= { 1, xij =aij , 0, xij 6=aij , where T is an arbitrary spanning tree of G, and di is the degree of node i in T (Fromer and Globerson 2009). Example 4 (QKP). The Quadratic Knapsack Problem with multiple constraints (Wang, Kochenberger, and Glover 2012) is stated as max x∈{0,1}|V | ∑ i,j∈V xicijxj , s.t. ∑ i∈V w i xi 6 bk, k ∈ K, where all cij and w i are non-negative real numbers. The above QKP can be reformulated as max x ∑ i∈V θi(xi) + ∑ {i,j}∈E θij(xi, xj), s.t. ∑ i∈V φi (xi) 6 0, φi(xi) = w k i xi − b/|V |, k ∈ K where θi(xi) = ciixi, E = {{i, j}|cij + cji > 0} and θij(xi, xj) = (cij + cji)xixj . Belief Propagation with Constraints without the High Order Penalties Problem (2) is NP hard in general, thus we first show how to relax the problem. LP Relaxations We consider the following LP relaxation for (2), μ∗ = argmax μ∈L ∑ c∈C ∑ xc μc(xc)θc(xc), (7)
منابع مشابه
Efficient Sampling from Combinatorial Space via Bridging
MCMC sampling has been extensively studied and used in probabilistic inference. Many algorithms rely on local updates to explore the space, often resulting in slow convergence or failure to mix when there is no path from one set of states to another via local changes. We propose an efficient method for sampling from combinatorial spaces that addresses these issues via “bridging states” that fac...
متن کاملThe preemptive resource-constrained project scheduling problem subject to due dates and preemption penalties: An integer programming approach
Extensive research has been devoted to resource constrained project scheduling problem. However, little attention has been paid to problems where a certain time penalty must be incurred if activity preemption is allowed. In this paper, we consider the project scheduling problem of minimizing the total cost subject to resource constraints, earliness-tardiness penalties and preemption penalties, ...
متن کاملSolving the Single Machine Problem with Quadratic Earliness and Tardiness Penalties
Nowadays, scheduling problems have a considerable application in production and service systems. In this paper, we consider the scheduling of n jobs on a single machine assuming no machine idleness, non-preemptive jobs and equal process times. In many of previous researches, because of the delivery dalays and holding costs, earliness and tardiness penalties emerge in the form of linear combin...
متن کاملConstraint Handling in Genetic Algorithms: The Set Partitioning Problem
In this paper we present a genetic algorithm-based heuristic for solving the set partitioning problem (SPP). The SPP is an important combinatorial optimisation problem used by many airlines as a mathematical model for flight crew scheduling. A key feature of the SPP is that it is a highly constrained problem, all constraints being equalities. New genetic algorithm (GA) components: separate fitn...
متن کامل